Runtime Systems and Tools:
Heterogeneous Computing
People
Papers/Talks
22-05
2022
[Paper]
[Paper]
Improving Scalability with GPU-Aware Asynchronous Tasks [HIPS 2022]
21-02
2021
[Poster]
[Poster]
CharminG: A Scalable GPU-resident Runtime System [HPDC 2021]
20-04
2020
[Paper]
[Paper]
Achieving Computation-Communication Overlap with Overdecomposition on GPU Systems [ESPM2 2020]
20-02
2020
[Paper]
[Paper]
End-to-end Performance Modeling of Distributed GPU Applications [ICS 2020]
19-06
2019
[Poster]
[Poster]
ACM SRC: Fast Profiling-based Performance Modeling of Distributed GPU Applications [SC 2019]
17-11
2017
[Poster]
[Poster]
ACM SRC: Runtime Support for Concurrent Execution of Overdecomposed Heterogeneous Tasks [SC 2017]
16-14
2016
[Paper]
[Paper]
Runtime Coordinated Heterogeneous Tasks in Charm++ [ESPM2 2016]
12-06
2012
[Paper]
[Paper]
Dynamic Scheduling for Work Agglomeration on Heterogeneous Clusters [Workshop on Multicore and GPU Programming Models, Languages and Compilers at IPDPS 2012]
10-16
2010
[Paper]
[Paper]
Scaling Hierarchical N-Body Simulations on GPU Clusters [SC 2010]
09-09
2009
[Paper]
[Paper]
Towards a Framework for Abstracting Accelerators in Parallel Applications: Experience with Cell [SC 2009]
09-06
2009
[Paper]
[Paper]
Flexible Hardware Mapping for Finite Element Simulations on Hybrid CPU / GPU Clusters [SAAHPC 2009]
08-12
2008
[MS Thesis]
[MS Thesis]
An Application Programming Interface for General Purpose Graphics Processing Units in an Asynchronous Runtime System [Thesis 2008]
06-20
2006
[Poster]
[Poster]
Charm++ on Cell [PPL Poster 2006]
06-19
2006
[Poster]
[Poster]
Charm++ Simplifies Programming for the Cell Processor [SC 2006]